Discriminative Learning of Feature Functions of Generative Type in Speech Translation

نویسندگان

  • Xiaodong He
  • Li Deng
چکیده

 The speech translation (ST) problem can be formulated as a log-linear model with multiple features that capture different levels of dependency between the input voice observation and the output translations. However, while the log-linear model itself is of discriminative nature, many of the feature functions are derived from generative models, which are usually estimated by conventional maximum likelihood estimation. In this paper, we first present the formulation of the ST problem as a log-linear model with a plurality of feature functions. We then describe a general discriminative learning framework for training these generative features based on a technique called growth transformation (GT). The proposed approach is evaluated on a spoken language translation benchmark test of IWSLT. Our experimental results show that the proposed method leads to significant improvement of translation quality. Fast and stable convergence can also be achieved by the proposed method. 1. Electronic Submission Speech translation (ST) takes the source speech signal as input and produces as output the translated text of that utterance in another language. It can be viewed as automatic speech recognition (ASR) and machine translation (MT) in tandem. Like many other machine learning problems, the speech translation (ST) problem can be modeled by a log-linear model with multiple features that capture different dependencies between the input voice observation and the output translations. Although the log-linear model itself is a discriminative model, many of the feature functions, such as scores of ASR outputs, are still derived from generative models. Further, these features are usually trained by conventional maximum likelihood estimation. In this paper, we propose a general framework of discriminative training for these generative features based on a technique called growth transformation (GT). The proposed approach is evaluated on a spoken language translation benchmark test called IWSLT. Our experimental results show that the proposed method leads to significant translation performance improvement. It is also shown that fast and stable convergence can be achieved by the proposed GT based optimization method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discriminative Models for Semi-Supervised Natural Language Learning

An interesting question surrounding semisupervised learning for NLP is: should we use discriminative models or generative models? Despite the fact that generative models have been frequently employed in a semi-supervised setting since the early days of the statistical revolution in NLP, we advocate the use of discriminative models. The ability of discriminative models to handle complex, high-di...

متن کامل

FEAST at Play: Feature ExtrAction using Score function Tensors

Feature learning forms the cornerstone for tackling challenging learning problems in domains such as speech, computer vision and natural language processing. In this paper, we build upon a novel framework called FEAST(Feature ExtrAction using Score function Tensors) which incorporates generative models for discriminative learning. FEAST considers a novel class of matrix and tensor-valued featur...

متن کامل

Score Function Features for Discriminative Learning: Matrix and Tensor Frameworks

Feature learning forms the cornerstone for tackling challenging learning problems in domains such as speech, computer vision and natural language processing. In this paper, we consider a novel class of matrix and tensor-valued features, which can be pre-trained using unlabeled samples. We present efficient algorithms for extracting discriminative information, given these pre-trained features an...

متن کامل

Score Function Features for Discriminative Learning: Matrix and Tensor Framework

Feature learning forms the cornerstone for tackling challenging learning problems in domains such as speech, computer vision and natural language processing. In this paper, we consider a novel class of matrix and tensor-valued features, which can be pre-trained using unlabeled samples. We present efficient algorithms for extracting discriminative information, given these pre-trained features an...

متن کامل

Score Function Features for Discriminative Learning

Feature learning forms the cornerstone for tackling challenging learning problems in domains such as speech, computer vision and natural language processing. In this paper, we consider a novel class of matrix and tensor-valued features, which can be pre-trained using unlabeled samples. We present efficient algorithms for extracting discriminative information, given these pre-trained features an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013